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(57) Abstract 



In a method and device for recovering the high frequency content of a wideband signal previously down-sampled during encoding, 
and for injecting, during decoding, this high frequency content in an over-sampled synthesized version of the wideband signal to produce a 
full-spectrum synthesized wideband signal, a white noise generator produces a white noise sequence. Serially interconnected gain adjustment 
unit, spectral shaper and band-pass filter spectrally shapes the white noise sequence in relation to a set of shaping parameters representative 
of the down-sampled wideband signal such as a voicing factor, an energy scaling factor, a tilt scaling factor, and linear prediction filter 
coefficients. A signal injection circuit finally injects the spectrally-shaped white noise sequence in the over-sampled synthesized signal 
version to thereby produce the full-spectrum synthesized wideband signal. 
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HIGH FREQUENCY CONTENT RECOVERING METHOD AND 
DEVICE FOR OVER-SAMPLED SYNTHESIZED WIDEBAND SIGNAL 

5 

BACKGROUND OF THE INVENTION 

1. Field of the invention: 

10 

The present invention relates to a method and device for 
recovering a high frequency content of a wideband signal previously 
down-sampled, and for injecting this high frequency content in an over- 
sampled synthesized version of the down-sampled wideband signal to 
15 produce a full-spectrum synthesized wideband signal. 

2. Brief description of the prior art: 

20 The demand for efficient digital wideband speech/audio 

encoding techniques with a good subjective quality/bit rate trade-off is 
increasing for numerous applications such as audio/video 
teleconferencing, multimedia, and wireless applications, as well as 
Internet and packet network applications. Until recently, telephone 

25 bandwidths filtered in the range 200-3400 Hz were mainly used in speech 
coding applications. However, there is an increasing demand for 
wideband speech applications in order to increase the intelligibility and 
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naturalness of the speech signals. A bandwidth in the range 50-7000 Hz 
was found sufficient for delivering a face-to-face speech quality. For 
audio signals, this range gives an acceptable audio quality, but still lower 
than the CD quality which operates on the range 20-20000 Hz. 

5 A speech encoder converts a speech signal into a digital 

bitstrearn which is transmitted over a communication channel (or stored 
in a storage medium). The speech signal is digitized (sampled and 
quantized with usually 16-bits per sample) and the speech encoder has 
the role of representing these digital samples with a smaller number of 
10 bits while maintaining a good subjective speech quality. The speech 
decoder or synthesizer operates on the transmitted or stored bit stream 
and converts it back to a sound signal. 

One of the best prior art techniques capable of achieving a 
15 good quality/bit rate trade-off is the so-called Code Excited Linear 
Prediction (CELP) technique. According to this technique, the sampled 
speech signal is processed in successive blocks of L samples usually 
called frames where L is some predetermined number (corresponding to 
10-30 ms of speech). In CELP, a linear prediction (LP) synthesis filter is 
20 computed and transmitted every frame. The L-sample frame is then 
divided into smaller blocks called subframes of size of N samples, where 
L=kN and k is the number of subframes in a frame (N usually corresponds 
to 4-10 ms of speech). An excitation signal is determined in each 
subframe, which usually consists of two components: one from the past 
25 excitation (also called pitch contribution or adaptive codebook) and the 
other from an innovative codebook (also called fixed codebook). This 
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excitation signal is transmitted and used at the decoder as the input of the 
LP synthesis filter in order to obtain the synthesized speech. 

An innovative codebook in the CELP context, is an indexed set 
of A/-sample-long sequences which will be referred to as A/-dimensional 
5 codevectors. Each codebook sequence is indexed by an integer k 
ranging from 1 to M where M represents the size of the codebook often 
expressed as a number of bits b, where M=2 b . 

To synthesize speech according to the CELP technique, each 
1 0 block of N samples is synthesized by filtering an appropriate codevector from 
a codebook through time varying filters modeling the spectral characteristics 
of the speech signal. At the encoder end, the synthesis output is computed 
for all, or a subset, of the codevectors from the codebook (codebook search). 
The retained codevector is the one producing the synthesis output closest 
15 to the original speech signal according to a perceptually weighted distortion 
measure. This perceptual weighting is performed using a so-called 
perceptual weighting filter, which is usually derived from the LP synthesis 
filter. 

20 The CELP model has been very successful in encoding 

telephone band sound signals, and several CELP-based standards exist in 
a wide range of applications, especially in digital cellular applications. In the 
telephone band, the sound signal is band-limited to 200-3400 Hz and 
sampled at 8000 samples/sec. In wideband speech/audio applications, the 

25 sound signal is band-limited to 50-7000 Hz and sampled at 16000 
samples/sec. 
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Some difficulties arise when applying the telephone-band 
optimized CELP model to wideband signals, and additional features need to 
be added to the model in order to obtain high quality wideband signals. 
Wideband signals exhibit a much wider dynamic range compared to 
telephone-band signals, which results in precision problems when a fixed- 

5 point implementation of the algorithm is required (which is essential in 
wireless applications). Further, the CELP model will often spend most of its 
encoding bits on the low-frequency region, which usually has higher energy 
contents, resulting in a low-pass output signal. To overcome this problem, 
the perceptual weighting filter has to be modified in order to suit wideband 

10 signals, and pre-emphasis techniques which boost the high frequency 
regions become important to reduce the dynamic range, yielding a simpler 
fixed-point implementation, and to ensure a better encoding of the higher 
frequency contents of the signal. Further, the pitch contents in the spectrum 
of voiced segments in wideband signals do not extend over the whole 

1 5 spectrum range, and the amount of voicing shows more variation compared 
to narrow-band signals. Thus, it is important to improve the closed-loop pitch 
analysis to better accommodate the variations in the voicing level. 

Some difficulties arise when applying the telephone-band 
20 optimized CELP model to wideband signals, and additional features need to 
be added to the model in order to obtain high quality wideband signals. 

As an example, in order to improve the coding efficiency and 
reduce the algorithmic complexity of the wideband encoding algorithm, the 
25 input wideband signal is down-sampled from 16 kHz to around 12.8 kHz. 
This reduces the number of samples in a frame, the processing time and the 
signal bandwidth below 7000 Hz to thereby enable reduction in bit rate down 
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to 12 kbit/s while keeping very high quality decoded sound signal. The 
complexity is also reduced due to the lower number of samples per speech 
frame. At the decoder, the high frequency contents of the signal needs to 
be reintroduced to remove the low pass filtering effect from the decoded 
synthesized signal and retrieve the natural sounding quality of wideband 
5 signals. For that purpose, an efficient technique for recovering the high 
frequency content of the wideband signal is needed to thereby produce a 
full-spectrum wideband synthesized signal, while maintaining a quality close 
to the original signal. 

10 

OBJECT OF THE INVENTION 

An object of the present invention is therefore to provide such an 
1 5 efficient high frequency content recovery technique. 

SUMMARY OF THE INVENTION 

20 

More specifically, in accordance with the present invention, there 
is provided a method for recovering a high frequency content of a wideband 
signal previously down-sampled and for injecting the high frequency content 
in an over-sampled synthesized version of the wideband signal to produce 
25 a full-spectrum synthesized wideband signal. This high-frequency content 
recovering method comprises: generating a noise sequence; spectrally- 
shaping the noise sequence in relation to shaping parameters representative 
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of the down-sampled wideband signal; and injecting the spectrally-shaped 
noise sequence in the over-sampled synthesized signal version to thereby 
produce the full-spectrum synthesized wideband signal. 

The present invention further relates to a device for recovering a 
5 high frequency content of a wideband signal previously down-sampled and 
for injecting this high frequency content in an over-sampled synthesized 
version of the wideband signal to produce a full-spectrum synthesized 
wideband signal. This high-frequency content recovering device comprises 
a noise generator for producing a noise sequence, a spectral shaping unit 
10 for shaping the noise sequence in relation to shaping parameters 
representative of the down-sampled wideband signal, and a signal injection 
circuit for injecting the spectrally-shaped noise sequence in the over-sampled 
synthesized signal version to thereby produce the full-spectrum synthesized 
wideband signal. 

15 

In accordance with a preferred embodiment, the noise sequence 
is a white noise sequence. 

Preferably, spectral shaping of the noise sequence comprises: 
20 producing a scaled white noise sequence in response to the white noise 
sequence and a first subset of the shaping parameters; filtering the scaled 
white noise sequence in relation to a second subset of the shaping 
parameters comprising bandwidth expanded synthesis filter coefficients to 
produce a filtered scaled white noise sequence characterized by a frequency 
25 bandwidth generally higher than a frequency bandwidth of the over-sampled 
synthesized signal version; and band-pass filtering the filtered scaled white 
noise sequence to produce a band-pass filtered scaled white noise 
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sequence to be subsequently injected in the over-sampled synthesized 
signal version as the spectrally-shaped white noise sequence. 

Still according to the present invention, there is provided a 
decoder for producing a synthesized wideband signal, comprising: 
5 a) a signal fragmenting device for receiving an encoded version 

of a wideband signal previously down-sampled during encoding and 
extracting from the encoded wideband signal version at least pitch codebook 
parameters, innovative codebook parameters, and synthesis filter 
coefficients; 

10 b) a pitch codebook responsive to the pitch codebook parameters 

for producing a pitch codevector; 

c) an innovative codebook responsive to the innovative codebook 
parameters for producing an innovative codevector, 

d) a combiner circuit for combining the pitch codevector and the 
1 5 innovative codevector to thereby produce an excitation signal; 

e) a signal synthesis device including a synthesis filter for filtering 
the excitation signal in relation to the synthesis filter coefficients to thereby 
produce a synthesized wideband signal, and an oversampler responsive to 
the synthesized wideband signal for producing an over-sampled signal 

20 version of the synthesized wideband signal; and 

f) a high-frequency content recovering device as described 
hereinabove, for recovering a high frequency content of the wideband signal 
and for injecting the high frequency content in the over-sampled signal 
version to produce the full-spectrum synthesized wideband signal. 

25 

In accordance with a preferred embodiment, the decoder further 
comprises: 
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10 



a) a voicing factor generator responsive to the adaptive and 
innovative codevectors for calculating a voicing factor for forwarding to the 
gain adjustment module; 

b) an energy computing module responsive to the excitation signal 
for calculating an excitation energy for forwarding to the gain adjustment 
module; and 

c) a spectral tilt calculator responsive to the synthesized signal for 
calculating a tilt scaling factor for forwarding to the gain adjustment module. 
The first subset of the shaping parameters comprises the voicing factor, the 
energy scaling factor, and the tilt scaling factor, and the second subset of the 
shaping parameters includes linear prediction coefficients. 



15 



In accordance with other preferred embodiments of the decoder: 



- the voicing 
relation: 



factor generator calculates the voicing factor r v using the 



r v = (E v - E c ) I (E v + E c ) 

where E v is the energy of the gain scaled pitch codevector and E c is the 
20 energy of the gain scaled innovative codevector; 

- the gain adjusting unit calculates an energy scaling factor using the relation: 



25 



Energy scaling factor = 



\ 



A/-1 

n=Q 

AT-1 

E w ' 2 ( n ) 

n=0 



n=0 AM. 
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where w' is the white noise sequence and u' is an enhanced excitation 
signal derived from the excitation signal; 

- the spectral tilt calculator calculates the tilt scaling factor g, using the 
relation: 

5 g l ='\-titt bounded by 0.2 s g, <;1 .0 

where 

E s h <»> s h (" " 1 > conditioned by f//f * 0 and tilt :> r v . 

tilt = — 

or the relation: 

15 g,=10 0 6 "" bounded by 0.2 * g t *1.0 

where 

20 £s„ (»> s » < n ~ 1 > 

tf/f = — , conditioned by tilt > 0 and tilt :> 

E V <") 

n=0 



25 



Preferably, the band-pass filter has a frequency bandwidth located 
between 5.6 kHz and 7.2 kHz. 
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Also according to the present invention, in a decoder for 
producing a synthesized wideband signal, comprising: 

a) a signal fragmenting device for receiving an encoded version 
of a wideband signal previously down-sampled during encoding and 
extracting from the encoded wideband signal version at least pitch codebook 

5 parameters, innovative codebook parameters, and synthesis filter 
coefficients; 

b) a pitch codebook responsive to the pitch codebook parameters 
for producing a pitch codevector; 

c) an innovative codebook responsive to the innovative codebook 
1 0 parameters for producing an innovative codevector; 

d) a combiner circuit for combining the pitch codevector and the 
innovative codevector to thereby produce an excitation signal; and 

e) a signal synthesis device including a synthesis filter for filtering 
the excitation signal in relation to the synthesis filter coefficients to thereby 

1 5 produce a synthesized wideband signal, and an oversampler responsive to 
the synthesized wideband signal for producing an over-sampled signal 
version of the synthesized wideband signal; 

the improvement comprising a high-frequency content recovering device as 
described hereinabove for recovering a high frequency content of the 
20 wideband signal and for injecting the high frequency content in the over- 
sampled signal version to produce the full-spectrum synthesized wideband 
signal. 

The present invention finally comprises a cellular communication 
25 system, a cellular mobile transmitter/receiver unit, a cellular network element, 
and a bidirectional wireless communication sub-system comprising the 
above described decoder. 



WO 00/25305 



PCT/CA99/00990 



11 

The objects, advantages and other features of the present 
invention will become more apparent upon reading of the following non 
restrictive description of a preferred embodiment thereof, given by way of 
example only with reference to the accompanying drawings. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 
In the appended drawings: 

10 

Figure 1 is a schematic block diagram of a preferred embodiment of 
wideband encoding device; 

Figure 2 is a schematic block diagram of a preferred embodiment of 
15 wideband decoding device; 

Figure 3 is a schematic block diagram of a preferred embodiment of 
pitch analysis device; and 

20 Figure 4 is a simplified, schematic block diagram of a cellular 

communication system in which the wideband encoding device of Figure 1 
and the wideband decoding device of Figure 2 can be used. 



25 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
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As well known to those of ordinary skill in the art, a cellular 
communication system such as 401 (see Figure 4) provides a 
telecommunication service over a large geographic area by dividing that 
large geographic area into a number C of smaller cells. The C smaller cells 
are serviced by respective cellular base stations 402 1f 402 2 ... 402 c to 
5 provide each cell with radio signalling, audio and data channels. 

Radio signalling channels are used to page mobile radiotelephones 
(mobile transmitter/receiver units) such as 403 within the limits of the 
coverage area (cell) of the cellular base station 402, and to place calls to 
1 0 other radiotelephones 403 located either inside or outside the base station's 
cell or to another network such as the Public Switched Telephone Network 
(PSTN) 404. 

Once a radiotelephone 403 has successfully placed or received a 
15 call, an audio or data channel is established between this radiotelephone 
403 and the cellular base station 402 corresponding to the cell in which the 
radiotelephone 403 is situated, and communication between the base 
station 402 and radiotelephone 403 is conducted over that audio or data 
channel. The radiotelephone 403 may also receive control or timing 
20 information over a signalling channel while a call is in progress. 

If a radiotelephone 403 leaves a cell and enters another adjacent cell 
while a call is in progress, the radiotelephone 403 hands over the call to an 
available audio or data channel of the new cell base station 402. If a 
25 radiotelephone 403 leaves a cell and enters another adjacent cell while no 
call is in progress, the radiotelephone 403 sends a control message over the 
signalling channel to log into the base station 402 of the new cell. In this 
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manner mobile communication over a wide geographical area is possible. 

The cellular communication system 401 further comprises a control 
terminal 405 to control communication between the cellular base stations 
402 and the PSTN 404, for example during a communication between a 
5 radiotelephone 403 and the PSTN 404, or between a radiotelephone 403 
located in a first cell and a radiotelephone 403 situated in a second cell. 

Of course, a bidirectional wireless radio communication subsystem 
is required to establish an audio or data channel between a base station 402 
10 of one cell and a radiotelephone 403 located in that cell. As illustrated in 
very simplified form in Figure 4, such a bidirectional wireless radio 
communication subsystem typically comprises in the radiotelephone 403: 

- a transmitter 406 including: 

- an encoder 407 for encoding the voice signal; and 

15 - a transmission circuit 408 for transmitting the encoded voice 

signal from the encoder 407 through an antenna such as 409; 
and 

- a receiver 410 including: 

- a receiving circuit 41 1 for receiving a transmitted encoded voice 
20 signal usually through the same antenna 409; and 

- a decoder 412 for decoding the received encoded voice signal 
from the receiving circuit 41 1 . 

The radiotelephone further comprises other conventional 
25 radiotelephone circuits 41 3 to which the encoder 407 and decoder 41 2 are 
connected and for processing signals therefrom, which circuits 413 are well 
known to those of ordinary skill in the art and, accordingly, will not be further 
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described in the present specification. 

Also, such a bidirectional wireless radio communication subsystem 
typically comprises in the base station 402: 

- a transmitter 414 including: 

5 - an encoder 41 5 for encoding the voice signal; and 

- a transmission circuit 416 for transmitting the encoded voice 
signal from the encoder 415 through an antenna such as 417; 
and 

- a receiver 41 8 including: 

10 - a receiving circuit 41 9 for receiving a transmitted encoded voice 

signal through the same antenna 417 or through another antenna 
(not shown); and 

- a decoder 420 for decoding the received encoded voice signal 
from the receiving circuit 419. 

15 

The base station 402 further comprises, typically, a base station 
controller 421, along with its associated database 422, for controlling 
communication between the control terminal 405 and the transmitter 414 and 
receiver 41 8. 

20 

As well known to those of ordinary skill in the art, voice encoding is 
required in order to reduce the bandwidth necessary to transmit sound 
signal, for example voice signal such as speech, across the bidirectional 
wireless radio communication subsystem, i.e., between a radiotelephone 
25 403 and a base station 402. 

LP voice encoders (such as 415 and 407) typically operating at 13 
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kbits/second and below such as Code-Excited Linear Prediction (CELP) 
encoders typically use a LP synthesis filter to model the short-term spectral 
envelope of the voice signal. The LP information is transmitted, typically, 
every 10 or 20 ms to the decoder (such 420 and 412) and is extracted at the 
decoder end. 

5 

The novel techniques disclosed in the present specification may apply 
to different LP-based coding systems. However, a CELP-type coding 
system is used in the preferred embodiment for the purpose of presenting a 
non-limitative illustration of these techniques. In the same manner, such 
10 techniques can be used with sound signals other than voice and speech as 
well with other types of wideband signals. 

Figure 1 shows a general block diagram of a CELP-type speech 
encoding device 100 modified to better accommodate wideband signals. 

The sampled input speech signal 1 14 is divided into successive L- 
sample blocks called "frames". In each frame, different parameters 
representing the speech signal in the frame are computed, encoded, and 
transmitted. LP parameters representing the LP synthesis filter are usually 

20 computed once every frame. The frame is further divided into smaller blocks 
of N samples (blocks of length A/), in which excitation parameters (pitch and 
innovation) are determined. In the CELP literature, these blocks of length N 
are called "subframes" and the A/-sample signals in the subframes are 
referred to as A/-dimensional vectors. In this preferred embodiment, the 

25 length N corresponds to 5 ms while the length L corresponds to 20 ms, 
which means that a frame contains four subframes (A/=80 at the sampling 
rate of 16 kHz and 64 after down-sampling to 12.8 kHz). Various A/- 
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dimensional vectors occur in the encoding procedure. A list of the vectors 
which appear in Figures 1 and 2 as well as a list of transmitted parameters 
are given herein below: 

List of the main A/- dimensional vectors 

5 

s Wideband signal input speech vector (after down-sampling, pre- 
processing, and preemphasis); 
s w Weighted speech vector; 
s 0 Zero-input response of weighted synthesis filter; 
10 s p Down-sampled pre-processed signal; 

Oversampled synthesized speech signal; 

s' Synthesis signal before deemphasis; 

s d Deemphasized synthesis signal; 
15 s h Synthesis signal after deemphasis and postprocessing; 

x Target vector for pitch search; 

x 9 Target vector for innovation search; 

h Weighted synthesis filter impulse response; 

v T Adaptive (pitch) codebook vector at delay 7; 
20 y r Filtered pitch codebook vector (v r convolved with h); 

c k Innovative codevector at index k (/c-th entry from the innovation 
codebook); 

c f Enhanced scaled innovation codevector; 

u Excitation signal (scaled innovation and pitch codevectors); 
25 u' Enhanced excitation; 

z Band-pass noise sequence; 

w' White noise sequence; and 
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w Scaled noise sequence. 



List of transmitted parameters 



STP 



Short term prediction parameters (defining A(z))\ 

Pitch lag (or pitch codebook index); 

Pitch gain (or pitch codebook gain); 

Index of the low-pass filter used on the pitch codevector; 

Codevector index (innovation codebook entry); and 

Innovation codebook gain. 



5 



T 



b 



J 



k 



9 



10 



In this preferred embodiment, the STP parameters are transmitted 
once per frame and the rest of the parameters are transmitted four times per 
frame (every subframe). 



The sampled speech signal is encoded on a block by block basis by 
the encoding device 100 of Figure 1 which is broken down into eleven 
modules numbered from 1 01 to 1 1 1 . 



The input speech is processed into the above mentioned L-sample 
blocks called frames. 

Referring to Figure 1 , the sampled input speech signal 1 14 is down- 
25 sampled in a down-sampling module 101 . For example, the signal is down- 
sampled from 16 kHz down to 12.8 kHz, using techniques well known to 
those of ordinary skill in the art. Down-sampling down to another frequency 



15 
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can of course be envisaged. Down-sampling increases the coding 
efficiency, since a smaller frequency bandwidth is encoded. This also 
reduces the algorithmic complexity since the number of samples in a frame 
is decreased. The use of down-sampling becomes significant when the bit 
rate is reduced below 16 kbit/s, although down-sampling is not essential 
5 above 16 kbit/s. 

After down-sampling, the 320-sample frame of 20 ms is reduced to 
256-sample frame (down-sampling ratio of 4/5). 

1 0 The input frame is then supplied to the optional pre-processing block 

1 02. Pre-processing block 1 02 may consist of a high-pass filter with a 50 Hz 
cut-off frequency. High-pass filter 102 removes the unwanted sound 
components below 50 Hz. 

15 The down-sampled pre-processed signal is denoted by s p (n), n=0, 1, 

2, ...,Z--1, where L is the length of the frame (256 at a sampling frequency of 
12.8 kHz). In a preferred embodiment of the preemphasis filter 103, the 
signal s p (n) is preemphasized using a filter having the following transfer 
function: 

20 

P(z) = 1 - yz" 1 

25 

where /z is a preemphasis factor with a value located between 0 and 1 (a 
typical value is ^ = 0.7). A higher-order filter could also be used. It should 
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be pointed out that high-pass filter 102 and preemphasis filter 103 can be 
interchanged to obtain more efficient fixed-point implementations. 

The function of the preemphasis filter 103 is to enhance the high 
frequency contents of the input signal. It also reduces the dynamic range of 
5 the input speech signal, which renders it more suitable for fixed-point 
implementation. Without preemphasis, LP analysis in fixed-point using 
single-precision arithmetic is difficult to implement. 

Preemphasis also plays an important role in achieving a proper 
10 overall perceptual weighting of the quantization error, which contributes to 
improved sound quality. This will be explained in more detail herein below. 

The output of the preemphasis filter 103 is denoted s(n). This signal 
is used for performing LP analysis in calculator module 104. LP analysis is 

15 a technique well known to those of ordinary skill in the art. In this preferred 
embodiment, the autocorrelation approach is used. In the autocorrelation 
approach, the signal s(n) is first windowed using a Hamming window (having 
usually a length of the order of 30-40 ms). The autocorrelations are 
computed from the windowed signal, and Levinson-Durbin recursion is used 

20 to compute LP filter coefficients, a,-, where /=1,...,p, and where p is the LP 
order, which is typically 16 in wideband coding. The parameters a, are the 
coefficients of the transfer function of the LP filter, which is given by the 
following relation: 



25 



A(z) = 1+IXz- 1 
/=i 
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LP analysis is performed in calculator module 104, which also 
performs the quantization and interpolation of the LP filter coefficients. The 
LP filter coefficients are first transformed into another equivalent domain 
more suitable for quantization and interpolation purposes. The line spectral 
5 pair (LSP) and immitance spectral pair (ISP) domains are two domains in 
which quantization and interpolation can be efficiently performed. The 16 LP 
filter coefficients, a h can be quantized in the order of 30 to 50 bits using split 
or multi-stage quantization, or a combination thereof. The purpose of the 
interpolation is to enable updating the LP filter coefficients every subframe 
10 while transmitting them once every frame, which improves the encoder 
performance without increasing the bit rate. Quantization and interpolation 
of the LP filter coefficients is believed to be otherwise well known to those of 
ordinary skill in the art and, accordingly, will not be further described in the 
present specification. 

15 

The following paragraphs will describe the rest of the coding 
operations performed on a subframe basis. In the following description, the 
filter A(z) denotes the unquantized interpolated LP filter of the subframe, and 
the filter A(z) denotes the quantized interpolated LP filter of the subframe. 

20 

Perceptual Weighting: 

In analysis-by-synthesis encoders, the optimum pitch and innovation 
parameters are searched by minimizing the mean squared error between the 
25 input speech and synthesized speech in a perceptually weighted domain. 
This is equivalent to minimizing the error between the weighted input speech 
and weighted synthesis speech. 
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The weighted signal sJin) is computed in a perceptual weighting filter 
1 05. Traditionally, the weighted signal sjin) is computed by a weighting filter 
having a transfer function W{z) in the form: 

5 W(z)=A(z/y t ) / A(z/y 2 ) where 0 <y 2 <Y^1 

As well known to those of ordinary skill in the art, in prior art analysis-by- 
synthesis (AbS) encoders, analysis shows that the quantization error is 

10 weighted by a transfer function W'\z), which is the inverse of the transfer 
function of the perceptual weighting filter 105. This result is well described 
by B.S. Atal and M.R. Schroeder in "Predictive coding of speech and 
subjective error criteria", IEEE Transaction ASSP, vol. 27, no. 3, pp. 247- 
254, June 1979. Transfer function W'\z) exhibits some of the formant 

15 structure of the input speech signal. Thus, the masking property of the 
human ear is exploited by shaping the quantization error so that it has more 
energy in the formant regions where it will be masked by the strong signal 
energy present in these regions. The amount of weighting is controlled by 
the factors y i and y 2 - 

20 

The above traditional perceptual weighting filter 105 works well with 
telephone band signals. However, it was found that this traditional 
perceptual weighting filter 105 is not suitable for efficient perceptual 
weighting of wideband signals. It was also found that the traditional 
25 perceptual weighting filter 105 has inherent limitations in modelling the 
formant structure and the required spectral tilt concurrently. The spectral tilt 
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is more pronounced in wideband signals due to the wide dynamic range 
between low and high frequencies. The prior art has suggested to add a tilt 
filter into W(z) in order to control the tilt and formant weighting of the 
wideband input signal separately. 

5 A novel solution to this problem is, in accordance with the present 

invention, to introduce the preemphasis filter 103 at the input, compute the 
LP filter A(z) based on the preemphasized speech s(n), and use a modified 
filter W{z) by fixing its denominator. 

10 LP analysis is performed in module 1 04 on the preemphasized signal 

s(ri) to obtain the LP filter A{z). Also, a new perceptual weighting filter 105 
with fixed denominator is used. An example of transfer function for the 
perceptual weighting filter 104 is given by the following relation: 

15 

W(z) = A (z/y,) / (l-y^ -1 ) where 0<Y 2 <Y^i 

A higher order can be used at the denominator. This structure substantially 
20 decouples the formant weighting from the tilt. 

Note that because A(z) is computed based on the preemphasized 
speech signal s(n), the tilt of the filter 1/A(z/yj) is less pronounced compared 

to the case when A(z) is computed based on the original speech. Since 
25 deemphasis is performed at the decoder end using a filter having the transfer 
function: 



WO 00/25305 



PCT/CA99/00990 



23 



P-\z) = \/(1-ixz 

the quantization error spectrum is shaped by a filter having a transfer 

5 function W~\z)P'\z). When y 2 is set equal to /v, which is typically the case, 

the spectrum of the quantization error is shaped by a filter whose transfer 

function is 1/A(z/Yj), with A(z) computed based on the preemphasized 

speech signal. Subjective listening showed that this structure for achieving 
the error shaping by a combination of preemphasis and modified weighting 
10 filtering is very efficient for encoding wideband signals, in addition to the 
advantages of ease of fixed-point algorithmic implementation. 

Pitch Analysis: 

15 

In order to simplify the pitch analysis, an open-loop pitch lag T OL is 
first estimated in the open-loop pitch search module 1 06 using the weighted 
speech signal sjn). Then the closed-loop pitch analysis, which is performed 
in closed-loop pitch search module 107 on a subframe basis, is restricted 
20 around the open-loop pitch lag T OL which significantly reduces the search 
complexity of the LTP parameters 7 and b (pitch lag and pitch gain). Open- 
loop pitch analysis is usually performed in module 106 once every 10 ms 
(two subframes) using techniques well known to those of ordinary skill in the 
art. 

25 
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The target vector x for LTP (Long Term Prediction) analysis is first 
computed. This is usually done by subtracting the zero-input response s 0 of 
weighted synthesis filter W(z)//A(z) from the weighted speech signal s w (n). 
This zero-input response s 0 is calculated by a zero-input response calculator 
108. More specifically, the target vector x is calculated using the following 
5 relation: 



x = s w - s 0 

10 

where x is the AWimensional target vector, s w is the weighted speech 
vector in the subframe, and s 0 is the zero-input response of filter W(z)/A(z) 
which is the output of the combined filter W(z)/A(z) due to its initial states. 
The zero-input response calculator 108 is responsive to the quantized 

1 5 interpolated LP filter A(z) from the LP analysis, quantization and interpolation 
calculator 104 and to the initial states of the weighted synthesis filter 
W(z)/A(z) stored in memory module 1 1 1 to calculate the zero-input response 
s 0 (that part of the response due to the initial states as determined by setting 
the inputs equal to zero) of filter W(z)/A(z). This operation is well known to 

20 those of ordinary skill in the art and, accordingly, will not be further described. 

Of course, alternative but mathematically equivalent approaches can 
be used to compute the target vector x. 

25 A A/-dimensional impulse response vector h of the weighted 

synthesis filter W(z)/A(z) is computed in the impulse response generator 109 
using the LP filter coefficients A(z) and A(z) from module 104. Again, this 



WO 00/25305 



PCT/CA99/00990 



25 

operation is well known to those of ordinary skill in the art and, accordingly, 
will not be further described in the present specification. 

The closed-loop pitch (or pitch codebook) parameters b, 7 and J are 
computed in the closed-loop pitch search module 107, which uses the target 
5 vector x, the impulse response vector h and the open-loop pitch lag 7" OL as 
inputs. Traditionally, the pitch prediction has been represented by a pitch 
filter having the following transfer function: 

10 1 / (1-6z r ) 

where b is the pitch gain and T is the pitch delay or lag. In this case, the 
pitch contribution to the excitation signal u(n) is given by bu{n-T), where the 
1 5 total excitation is given by 

u(n) = bu(n-T)+gc k (n) 

20 

with g being the innovative codebook gain and c^n) the innovative 
codevector at index k. 

This representation has limitations if the pitch lag Tis shorter than the 
25 subframe length N. In another representation, the pitch contribution can be 
seen as a pitch codebook containing the past excitation signal. Generally, 
each vector in the pitch codebook is a shift-by-one version of the previous 
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vector (discarding one sample and adding a new sample). For pitch lags 
T>N, the pitch codebook is equivalent to the filter structure (1/(1 -br r ) , and 
a pitch codebook vector v^ri) at pitch lag T is given by 

5 v T (n) = u (n-T) , n=0 AM. 

For pitch lags T shorter than A/, a vector v^ri) is built by repeating the 
available samples from the past excitation until the vector is completed (this 
10 is not equivalent to the filter structure). 

In recent encoders, a higher pitch resolution is used which 
significantly improves the quality of voiced sound segments. This is 
achieved by oversampling the past excitation signal using polyphase 
1 5 interpolation filters. In this case, the vector Vj(n) usually corresponds to an 
interpolated version of the past excitation, with pitch lag 7 being a non- 
integer delay (e.g. 50.25). 

The pitch search consists of finding the best pitch lag T and gain b 
20 that minimize the mean squared weighted error E between the target vector 
x and the scaled filtered past excitation. Error E being expressed as: 

E=\\x-by r \\ 2 

25 

where y T is the filtered pitch codebook vector at pitch lag T: 
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y T (n) = v T (n) * h(n) = J> r (i)h(n-i) . n=0,... l AM. 



5 It can be shown that the error E is minimized by maximizing the search 
criterion 



c- x ' yr 
fy*7> 



where t denotes vector transpose. 

15 

In the preferred embodiment of the present invention, a 1/3 
subsample pitch resolution is used, and the pitch (pitch codebook) search is 
composed of three stages. 

20 In the first stage, an open-loop pitch lag T OL is estimated in open-loop 

pitch search module 106 in response to the weighted speech signal sjn). 
As indicated in the foregoing description, this open-loop pitch analysis is 
usually performed once every 10 ms (two subfirames) using techniques well 
known to those of ordinary skill in the art. 

25 



In the second stage, the search criterion C is searched in the closed- 
loop pitch search module 107 for integer pitch lags around the estimated 
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open-loop pitch lag T OL (usually ±5), which significantly simplifies the search 
procedure. A simple procedure is used for updating the filtered codevector 
y T without the need to compute the convolution for every pitch lag. 

Once an optimum integer pitch lag is found in the second stage, a 
5 third stage of the search (module 107) tests the fractions around that 
optimum integer pitch lag. 

When the pitch predictor is represented by a filter of the form 
1/(1 -br T ), which is a valid assumption for pitch lags T>N, the spectrum of the 

1 0 pitch filter exhibits a harmonic structure over the entire frequency range, with 
a harmonic frequency related to 1/7. In case of wideband signals, this 
structure is not very efficient since the harmonic structure in wideband 
signals does not cover the entire extended spectrum. The harmonic 
structure exists only up to a certain frequency, depending on the speech 

1 5 segment. Thus, in order to achieve efficient representation of the pitch 
contribution in voiced segments of wideband speech, the pitch prediction 
filter needs to have the flexibility of varying the amount of periodicity over the 
wideband spectrum. 

20 A new method which achieves efficient modeling of the harmonic 

structure of the speech spectrum of wideband signals is disclosed in the 
present specification, whereby several forms of low pass filters are applied 
to the past excitation and the low pass filter with higher prediction gain is 
selected. 

25 

When subsample pitch resolution is used, the low pass filters can be 
incorporated into the interpolation filters used to obtain the higher pitch 
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resolution. In this case, the third stage of the pitch search, in which the 
fractions around the chosen integer pitch lag are tested, is repeated for the 
several interpolation filters having different low-pass characteristics and the 
fraction and filter index which maximize the search criterion C are selected. 

5 A simpler approach is to complete the search in the three stages 

described above to determine the optimum fractional pitch lag using only one 
interpolation filter with a certain frequency response, and select the optimum 
low-pass filter shape at the end by applying the different predetermined low- 
pass filters to the chosen pitch codebook vector v T and select the low-pass 
1 0 filter which minimizes the pitch prediction error. This approach is discussed 
in detail below. 

Figure 3 illustrates a schematic block diagram of a preferred 
embodiment of the proposed approach. 

15 

In memory module 303, the past excitation signal n<0, is stored. 
The pitch codebook search module 301 is responsive to the target vector x, 
to the open-loop pitch lag T OL and to the past excitation signal u(n), n<0, from 
memory module 303 to conduct a pitch codebook (pitch codebook) search 

20 minimizing the above-defined search criterion C. From the result of the 
search conducted in module 301, module 302 generates the optimum pitch 
codebook vector v T . Note that since a sub-sample pitch resolution is used 
(fractional pitch), the past excitation signal u(n), n<0 t is interpolated and the 
pitch codebook vector v r corresponds to the interpolated past excitation 

25 signal. In this preferred embodiment, the interpolation filter (in module 301 , 
but not shown) has a low-pass filter characteristic removing the frequency 
contents above 7000 Hz. 
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In a preferred embodiment, K filter characteristics are used; these 
filter characteristics could be low-pass or band-pass filter characteristics. 
Once the optimum codevector v 7 is determined and supplied by the pitch 
codevector generator 302, K filtered versions of v r are computed 
respectively using K different frequency shaping filters such as 3O5 0) , where 
5 j=1, 2, ... , K. These filtered versions are denoted v f a) , where y=7, 2, ... , K. 
The different vectors vf } are convolved in respective modules 304 G) , where 
j=0, 1, 2, ... , K, with the impulse response h to obtain the vectors y®, where 
y=0, 1, 2, ... , K. To calculate the mean squared pitch prediction error for 
each vector y°\ the value y % multiplied by the gain b by means of a 
1 0 corresponding amplifier 307® and the value by° } is subtracted from the target 
vector x by means of a corresponding subtracter 308®. Selector 309 selects 
the frequency shaping filter 3O5 0) which minimizes the mean squared pitch 
prediction error 

15 

e V) = H x - b (/) y V)f , y=1,2 K 

To calculate the mean squared pitch prediction error e® for each value of 
20 the value y 69 is multiplied by the gain b by means of a corresponding 
amplifier 307® and the value b^y 6 * is subtracted from the target vector x by 
means of subtracters 308®. Each gain b® is calculated in a corresponging 
gain calculator 306® in association with the frequency shaping filter at index 
j, using the following relationship: 

25 



b«= x V U) /||y (/) ll 2 
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In selector 309, the parameters b, T, and j are chosen based on v T or 
v,® which minimizes the mean squared pitch prediction error e. 

5 

Referring back to Figure 1, the pitch codebook index T is encoded 
and transmitted to multiplexer 112. The pitch gain b is quantized and 
transmitted to multiplexer 112. With this new approach, extra information is 
needed to encode the index j of the selected frequency shaping filter in 
10 multiplexer 112. For example, if three filters are used Q=0, 1, 2, 3), then two 
bits are needed to represent this information. The filter index information j 
can also be encoded jointly with the pitch gain b. 

1 5 Innovative codebook search: 

Once the pitch, or LTP (Long Term Prediction) parameters b, 7, and 
j are determined, the next step is to search for the optimum innovative 
excitation by means of search module 110 of Figure 1. First, the target 
20 vector x is updated by subtracting the LTP contribution: 

x'=x-/>y 7 

25 

where b is the pitch gain and y r is the filtered pitch codebook vector (the 
past excitation at delay T filtered with the selected low pass filter and 
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convolved with the inpulse response h as described with reference to Figure 
3). 

The search procedure in CELP is performed by finding the optimum 
excitation codevector c k and gain g which minimize the mean-squared error 
5 between the target vector and the scaled filtered codevector 

E = || x - gHc k || 2 

10 

where H is a lower triangular convolution matrix derived from the impulse 
response vector h. 

In the preferred embodiment of the present invention, the innovative 
15 codebook search is performed in module 110 by means of an algebraic 
codebook as described in US patents Nos: 5,444,816 (Adoul et al.) issued 
on August 22, 1995; 5,699,482 granted to Adoul et al., on December 17, 
1997; 5,754,976 granted to Adoul etal., on May 19, 1998; and 5,701,392 
(Adoul et al.) dated December 23, 1997. 

20 

Once the optimum excitation codevector c k and its gain g are chosen 
by module 110, the codebook index k and gain g are encoded and 
transmitted to multiplexer 112. 

25 Referring to Figure 1, the parameters b, T, j, A(z), k and g are 

multiplexed through the multiplexer 112 before being transmitted through a 
communication channel. 
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Memory update: 

In memory module 111 (Figure 1), the states of the weighted 
synthesis filter W(z)/C4(z) are updated by filtering the excitation signal 
5 u = gc k + bv T through the weighted synthesis filter. After this filtering, the 
states of the filter are memorized and used in the next subframe as initial 
states for computing the zero-input response in calculator module 108. 

As in the case of the target vector x, other alternative but 
1 0 mathematically equivalent approaches well known to those of ordinary skill 
in the art can be used to update the filter states. 

PECOPERSJDE 

15 

The speech decoding device 200 of Figure 2 illustrates the various 
steps earned out between the digital input 222 (input stream to the 
demultiplexer 217) and the output sampled speech 223 (output of the adder 
221). 

20 

Demultiplexer 217 extracts the synthesis model parameters from the 
binary information received from a digital input channel. From each received 
binary frame, the extracted parameters are: 



25 



- the short-term prediction parameters (STP) A(z) (once per frame); 
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- the long-term prediction (LTP) parameters 7, b, and j (for each 
subframe); and 

- the innovation codebook index k and gain g (for each subframe). 

5 The current speech signal is synthesized based on these parameters as will 
be explained hereinbelow. 

The innovative codebook 218 is responsive to the index kio produce 
the innovation codevector c*, which is scaled by the decoded gain factor g 
10 through an amplifier 224. In the preferred embodiment, an innovative 
codebook 218 as described in the above mentioned US patent numbers 
5,444,816; 5,699,482; 5,754,976; and 5,701,392 is used to represent the 
innovative codevector c* . 

1 5 The generated scaled codevector gc k at the output of the amplifier 

224 is processed through a innovation filter 205. 

Periodicity enhancement: 

20 The generated scaled codevector at the output of the amplifier 224 

is processed through a frequency-dependent pitch enhancer 205. 

Enhancing the periodicity of the excitation signal u improves the 
quality in case of voiced segments. This was done in the past by filtering 
25 the innovation vector from the innovative codebook (fixed codebook) 218 
through a filter in the form 1/(1-ebr r ) where e is a factor below 0.5 which 
controls the amount of introduced periodicity. This approach is less 
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efficient in case of wideband signals since it introduces periodicity over 
the entire spectrum. A new alternative approach, which is part of the 
present invention, is disclosed whereby periodicity enhancement is 
achieved by filtering the innovative codevector c k from the innovative 
(fixed) codebook through an innovation filter 205 (F(z)) whose frequency 
5 response emphasizes the higher frequencies more than lower 
frequencies. The coefficients of F(z) are related to the amount of 
periodicity in the excitation signal u. 

Many methods known to those skilled in the art are available for 
1 0 obtaining valid periodicity coefficients. For example, the value of gain b 
provides an indication of periodicity. That is, if gain b is close to 1, the 
periodicity of the excitation signal u is high, and if gain b is less than 0.5, 
then periodicity is low. 

1 5 Another efficient way to derive the filter F(z) coefficients used in a 

preferred embodiment, is to relate them to the amount of pitch 
contribution in the total excitation signal u. This results in a frequency 
response depending on the subframe periodicity, where higher 
frequencies are more strongly emphasized (stronger overall slope) for 

20 higher pitch gains. Innovation filter 205 has the effect of lowering the 
energy of the innovative codevector c„ at low frequencies when the 
excitation signal u is more periodic, which enhances the periodicity of the 
excitation signal u at lower frequencies more than higher frequencies. 
Suggested forms for innovation filter 205 are 

25 



(1) F(z)=1-oz-\ 



(2) F(z) = -az+1-az " 1 
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or 

where o or a are periodicity factors derived from the level of periodicity 
of the excitation signal u. 

5 The second three-term form of F(z) is used in a preferred 

embodiment. The periodicity factor a is computed in the voicing factor 
generator 204. Several methods can be used to derive the periodicity 
factor a based on the periodicity of the excitation signal u. Two methods 
are presented below. 

10 

Method 1 : 

The ratio of pitch contribution to the total excitation signal u is first 
computed in voicing factor generator 204 by 

15 



20 



I" 2 (") 

l»=0 



where v T is the pitch codebook vector, b is the pitch gain, and u is the 
excitation signal u given at the output of the adder 21 9 by 

25 u - gc k + bv T 



Note that the term bv T has its source in the pitch codebook (pitch 
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codebook) 201 in response to the pitch lag T and the past value of u 
stored in memory 203. The pitch codevector v r from the pitch codebook 
201 is then processed through a low-pass filter 202 whose cut-off 
frequency is adjusted by means of the index j from the demultiplexer 217. 
The resulting codevector v T is then multiplied by the gain b from the 
5 demultiplexer 217 through an amplifier 226 to obtain the signal bv T . 

The factor a is calculated in voicing factor generator 204 by 

a = qR p bounded by a < q 

10 

where q is a factor which controls the amount of enhancement {q is set 
to 0.25 in this preferred embodiment). 

Method 2: 

15 

Another method used in a preferred embodiment of the invention 
for calculating periodicity factor a is discussed below. 

First, a voicing factor r v is computed in voicing factor generator 204 

20 by 

r v = (E v - E c ) I (E v + E c ) 

25 where E v is the energy of the scaled pitch codevector bv T and E c is the 
energy of the scaled innovative codevector gc k . That is 
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E v = b 2 v T * v T = b 2 "zv T 2 (n) 
and 

n=0 

Note that the value of r v lies between -1 and 1 (1 corresponds to 
purely voiced signals and -1 corresponds to purely unvoiced signals). 

10 In this preferred embodiment, the factor a is then computed in 

voicing factor generator 204 by 

a =0.125 (1 +r v ) 

15 

which corresponds to a value of 0 for purely unvoiced signals and 0.25 
for purely voiced signals. 

20 In the first, two-term form of F(z), the periodicity factor a can be 

approximated by using a = 2a in methods 1 and 2 above. In such a 
case, the periodicity factor o is calculated as follows in method 1 above: 
a = 2qR p bounded by a < 2q. 



25 In method 2, the periodicity factor o is calculated as follows: 

o = 0.25 (1 + r v ). 
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The enhanced signal c, is therefore computed by filtering the 
scaled innovative codevector gc k through the innovation filter 205 (F(z)). 

The enhanced excitation signal u' is computed by the adder 220 

as: 

5 

u' = c f +bv T 

1 0 Note that this process is not performed at the encoder 100. Thus, 

it is essential to update the content of the pitch codebook 201 using the 
excitation signal u without enhancement to keep synchronism between 
the encoder 100 and decoder 200. Therefore, the excitation signal u is 
used to update the memory 203 of the pitch codebook 201 and the 

1 5 enhanced excitation signal u' is used at the input of the LP synthesis filter 
206. 

Synthesis and deemphasis 

20 

The synthesized signal s' is computed by filtering the enhanced 
excitation signal u' through the LP synthesis filter 206 which has the form 
MA(z), where A(z) is the interpolated LP filter in the current subframe. As 
can be seen in Figure 2, the quantized LP coefficients A(z) on line 225 from 
25 demultiplexer 217 are supplied to the LP synthesis filter 206 to adjust the 
parameters of the LP synthesis filter 206 accordingly. The deemphasis filter 
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207 is the inverse of the preemphasis filter 103 of Figure 1. The transfer 
function of the deemphasis filter 207 is given by 

D (z) = 1 / (1 -\xz 1 ) 

5 

where /x is a preemphasis factor with a value located between 0 and 1 (a 
typical value is n = 0.7). A higher-order filter could also be used. 

10 The vector s' is filtered through the deemphasis filter D(z) (module 

207) to obtain the vector which is passed through the high-pass filter 208 
to remove the unwanted frequencies below 50 Hz and further obtain s h . 

15 Oversampling and high-frequency regeneration 

The over-sampling module 209 conducts the inverse process of the 
down-sampling module 101 of Figure 1. In this preferred embodiment, 
oversampling converts from the 12.8 kHz sampling rate to the original 16 kHz 
20 sampling rate, using techniques well known to those of ordinary skill in the 
art. The oversampled synthesis signal is denoted S. Signal S is also 
referred to as the synthesized wideband intermediate signal. 

The oversampled synthesis § signal does not contain the higher 
25 frequency components which were lost by the downsampling process 
(module 101 of Figure 1) at the encoder 100. This gives a low-pass 
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perception to the synthesized speech signal. To restore the full band of the 
original signal, a high frequency generation procedure is disclosed. This 
procedure is performed in modules 210 to 216, and adder 221, and requires 
input from voicing factor generator 204 (Figure 2). 

5 In this new approach, the high frequency contents are generated by 

filling the upper part of the spectrum with a white noise properly scaled in the 
excitation domain, then converted to the speech domain, preferably by 
shaping it with the same LP synthesis filter used for synthesizing the down- 
sampled signal § . 

10 

The high frequency generation procedure in accordance with the 
present invention is described hereinbelow. 

The random noise generator 213 generates a white noise sequence 
15 iv' with a flat spectrum over the entire frequency bandwidth, using 
techniques well known to those of ordinary skill in the art. The generated 
sequence is of length N' which is the subframe length in the original domain. 
Note that N is the subframe length in the down-sampled domain. In this 
preferred embodiment, A/=64 and A/-80 which correspond to 5 ms. 

20 

The white noise sequence is properly scaled in the gain adjusting 
module 214. Gain adjustment comprises the following steps. First, the 
energy of the generated noise sequence w 9 is set equal to the energy of the 
enhanced excitation signal u' computed by an energy computing module 
25 210, and the resulting scaled noise sequence is given by 
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E«' 2 e» 



w(n) = W(n) 




/?=0,...,/v"-1. 



E "' 2 e» 



5 



The second step in the gain scaling is to take into account the high 
frequency contents of the synthesized signal at the output of the voicing 
factor generator 204 so as to reduce the energy of the generated noise in 
case of voiced segments (where less energy is present at high frequencies 

1 0 compared to unvoiced segments). In this preferred embodiment, measuring 
the high frequency contents is implemented by measuring the tilt of the 
synthesis signal through a spectral tilt calculator 212 and reducing the 
energy accordingly. Other measurements such as zero crossing 
measurements can equally be used. When the tilt is very strong, which 

1 5 corresponds to voiced segments, the noise energy is further reduced. The 
tilt factor is computed in module 212 as the first correlation coefficient of the 
synthesis signal s„ and it is given by: 



20 



E Si, (") s h ( n_1 > 



conditioned by tilt :> 0 and tilt * r r 



tilt = 



E V W 



25 



where voicing factor r v is given by 
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5 where E v is the energy of the scaled pitch codevectorbv r and E c is the 
energy of the scaled innovative codevector gc h as described earlier. Voicing 
factor r v is most often less than tilt but this condition was introduced as a 
precaution against high frequency tones where the tilt value is negative and 
the value of r v is high. Therefore, this condition reduces the noise energy for 

10 such tonal signals. 

The tilt value is 0 in case of flat spectrum and 1 in case of strongly 
voiced signals, and it is negative in case of unvoiced signals where more 
energy is present at high frequencies. 

15 

Different methods can be used to derive the scaling factor g, from the 
amount of high frequency contents. In this invention, two methods are given 
based on the tilt of signal described above. 

20 Method 1: 

The scaling factor g, is derived from the tilt by 



25 



g t = 1 - tilt bounded by 0.2 s g t <; 1 .0 
For strongly voiced signal where the tilt approaches 1 , g t is 0.2 and for 
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strongly unvoiced signals g t becomes 1.0. 
Method 2: 

The tilt factor g t is first restricted to be larger or equal to zero, then the 
5 scaling factor is derived from the tilt by 

g f =10-°- 6W ' 

10 

The scaled noise sequence w g produced in gain adjusting module 214 
is therefore given by: 

Wg = 9t * 

15 

When the tilt is close to zero, the scaling factor g t is close to 1 , which 
does not result in energy reduction. When the tilt value is 1, the scaling 
factor g t results in a reduction of 12 dB in the energy of the generated noise. 

20 Once the noise is properly scaled (w g ), it is brought into the speech 

domain using the spectral shaper 215. In the preferred embodiment, this is 
achieved by filtering the noise w g through a bandwidth expanded version of 
the same LP synthesis filter used in the down-sampled domain (1/A(z/0.8)). 
The corresponding bandwidth expanded LP filter coefficients are calculated 

25 in spectral shaper 215. 
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The filtered scaled noise sequence w f is then band-pass filtered to the 
required frequency range to be restored using the band-pass filter 216. In 
the preferred embodiment, the band-pass filter 216 restricts the noise 
sequence to the frequency range 5.6-7.2 kHz. The resulting band-pass 
filtered noise sequence z is added in adder 221 to the oversampled 
5 synthesized speech signal s to obtain the final reconstructed sound signal 
s out on the output 223. 

Although the present invention has been described hereinabove by 
way of a preferred embodiment thereof, this embodiment can be modified at 

10 will, within the scope of the appended claims, without departing from the 
spirit and nature of the subject invention. Even though the preferred 
embodiment discusses the use of wideband speech signals, it will be 
obvious to those skilled in the art that the subject invention is also directed 
to other embodiments using wideband signals in general and that it is not 

1 5 necessarily limited to speech applications. 
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WHAT IS CLAIMED IS: 

1 . A device for recovering a high frequency content of a wideband 
signal previously down-sampled and for injecting said high frequency content 
in an over-sampled synthesized version of said wideband signal to produce 
5 a full-spectrum synthesized wideband signal, said high-frequency content 
recovering device comprising: 

a) a noise generator for producing a noise sequence; 

b) a spectral shaping unit for shaping said noise sequence in 
relation to shaping parameters representative of said down-sampled 

1 0 wideband signal; and 

c) a signal injection circuit for injecting said spectrally-shaped 
noise sequence in said over-sampled synthesized signal version to thereby 
produce said full-spectrum synthesized wideband signal. 

15 2. A high-frequency content recovering device as defined in claim 1, 
wherein said noise generator comprises a random noise generator for 
producing a white noise sequence whereby said spectral shaping unit 
produces a spectrally-shaped white noise sequence. 

20 3. A high-frequency content recovering device as defined in claim 2, 
wherein said spectral shaping unit further comprises: 

a) a gain adjustment module, responsive to said white noise 
sequence and a first subset of said shaping parameters, for producing a 
scaled white noise sequence; 
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b) a spectral shaper for filtering said scaled white noise sequence 
in relation to a second subset of said shaping parameters comprising 
bandwidth expanded synthesis filter coefficients to produce a filtered scaled 
white noise sequence characterized by a frequency bandwidth generally 
higher than a frequency bandwidth of said over-sampled synthesized signal 

5 version; and 

c) a band-pass filter responsive to said filtered scaled white noise 
sequence for producing a band-pass filtered scaled white noise sequence 
to be subsequently injected in said over-sampled synthesized signal version 
as said spectrally-shaped white noise sequence. 

10 

4. A method for recovering a high frequency content of a wideband signal 
previously down-sampled and for injecting said high frequency content in an 
over-sampled synthesized version of said wideband signal to produce a full- 
spectrum synthesized wideband signal, said high-frequency content 
1 5 recovering method comprising: 

a) generating a noise sequence; 

b) spectrally-shaping said noise sequence in relation to shaping 
parameters representative of said down-sampled wideband signal; and 

c) injecting said spectrally-shaped noise sequence in said over- 
20 sampled synthesized signal version to thereby produce said full-spectrum 

synthesized wideband signal. 

5. A high-frequency content recovering method as defined in claim 4, 
wherein generating said noise sequence comprises producing a white noise 
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sequence whereby said spectral-shaping of the noise sequence produces 
a spectrally-shaped white noise sequence. 

6. A high-frequency content recovering method as defined in claim 5, 
wherein said spectral shaping of the noise sequence further comprises: 
5 a) producing a scaled white noise sequence in response to said 

white noise sequence and a first subset of said shaping parameters; 

b) filtering said scaled white noise sequence in relation to a 
second subset of said shaping parameters comprising bandwidth expanded 
synthesis filter coefficients to produce a filtered scaled white noise sequence 

1 0 characterized by a frequency bandwidth generally higher than a frequency 
bandwidth of said over-sampled synthesized signal version; and 

c) band-pass filtering said filtered scaled white noise sequence to 
produce a band-pass filtered scaled white noise sequence to be 
subsequently injected in said over-sampled synthesized signal version as 

1 5 said spectrally-shaped white noise sequence. 

7. A decoder for producing a synthesized wideband signal, comprising: 

a) a signal fragmenting device for receiving an encoded version 
of a wideband signal previously down-sampled during encoding and 

20 extracting from said encoded wideband signal version at least pitch 
codebook parameters, innovative codebook parameters, and synthesis filter 
coefficients; 

b) a pitch codebook responsive to said pitch codebook 
parameters for producing a pitch codevector; 
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c) an innovative codebook responsive to said innovative 
codebook parameters for producing an innovative codevector; 

d) a combiner circuit for combining said pitch codevector and said 
innovative codevector to thereby produce an excitation signal; 

e) a signal synthesis device including a synthesis filter for filtering 
said excitation signal in relation to said synthesis filter coefficients to thereby 
produce a synthesized wideband signal, and an oversampler responsive to 
said synthesized wideband signal for producing an over-sampled signal 
version of the synthesized wideband signal; and 

f) a high-frequency content recovering device as recited in claim 
1 for recovering a high frequency content of said wideband signal and for 
injecting said high frequency content in said over-sampled signal version to 
produce the full-spectrum synthesized wideband signal. 

8. A decoder for producing a synthesized wideband signal as defined in 
1 5 claim 7, wherein said noise generator comprises a random noise generator 

for producing a white noise sequence whereby said spectral shaping unit 
produces a spectrally-shaped white noise sequence. 

9. A decoder for producing a synthesized wideband signal as defined in 
20 claim 8, wherein said spectral shaping unit further comprises: 

a) a gain adjustment module, responsive to said white noise 
sequence and a first subset of said shaping parameters, for producing a 
scaled white noise sequence; 



5 



10 
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b) a spectral shaper for filtering said scaled white noise sequence 
in relation to a second subset of said shaping parameters comprising 
bandwidth expanded synthesis filter coefficients to produce a filtered scaled 
white noise sequence characterized by a frequency bandwidth generally 
higher than a frequency bandwidth of said over-sampled synthesized signal 

5 version; and 

c) a band-pass filter responsive to said filtered scaled white noise 
sequence for producing a band-pass filtered scaled white noise sequence 
to be subsequently injected in said over-sampled synthesized signal version 
as said spectrally-shaped white noise sequence. 

10 

10. A decoder for producing a synthesized wideband signal as defined in 
claim 9, further comprising: 

a) a voicing factor generator responsive to said adaptive and 
innovative codevectors for calculating a voicing factor for forwarding to said 

1 5 gain adjustment module; 

b) an energy computing module responsive to said excitation 
signal for calculating an excitation energy for forwarding to said gain 
adjustment module; and 

c) a spectral tilt calculator responsive to said synthesized signal 
20 for calculating a tilt scaling factor for forwarding to said gain adjustment 

module; 

wherein said first subset of said shaping parameters comprises said voicing 
factor, said energy scaling factor, and said tilt scaling factor, and wherein 
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said second subset of said shaping parameters includes linear prediction 
coefficients. 

1 1. A decoder for producing a synthesized wideband signal as defined in 
claim 10, wherein said voicing factor generator comprises a means for 
calculating said voicing factor r v using the relation: 

r v = (E v - E c ) I (E v + E c ) 

where E v is the energy of a gain-scaled version of the pitch codevector and 
E c is the energy of a gain-scaled version of the innovative codevector. 

12. A decoder for producing a synthesized wideband signal as defined in 
claim 10, wherein said gain adjusting unit comprises a means for calculating 
an energy scaling factor using the relation: 



E"' 2 <") 



Energy scaling factor = 




where is said white noise sequence and u' is an enhanced excitation 
signal derived from said excitation signal. 
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13. A decoder for producing a synthesized wideband signal as defined in 
claim 10, wherein said spectral tilt calculator comprises a means for 
calculating said tilt scaling factor g t using the relation: 

g t = 1 - tilt bounded by 0.2 <; g, <;1 .0 

where 



£ V (n) 



f//f = ji=i , conditioned by tilt z 0 and tilt z 



14. A decoder for producing a synthesized wideband signal as defined in 
claim 10, wherein said spectral tilt calculator comprising a means for 
calculating said tilt scaling factor g t using the relation: 

15 Sf" 10 bounded by 0.2 <.g t z 1.0 

where 



20 



A/-1 



W-1 

£ S/ 2 ( n j , conditioned by tilt > 0 and ff/r z r w 

n=0 
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15. A decoder for producing a synthesized wideband signal as defined in 
claim 9, wherein said band-pass filter comprises a frequency bandwidth 
located between 5.6 kHz and 7.2 kHz. 

5 

16. In a decoder for producing a synthesized wideband signal, comprising: 

a) a signal fragmenting device for receiving an encoded version 
of a wideband signal previously down-sampled during encoding and 
extracting from said encoded wideband signal version at least pitch 

1 0 codebook parameters, innovative codebook parameters, and synthesis filter 
coefficients; 

b) a pitch codebook responsive to said pitch codebook 
parameters for producing a pitch codevector; 

c) an innovative codebook responsive to said innovative 
1 5 codebook parameters for producing an innovative codevector; 

d) a combiner circuit for combining said pitch codevector and said 
innovative codevector to thereby produce an excitation signal; and 

e) a signal synthesis device including a synthesis filter for filtering 
said excitation signal in relation to said synthesis filter coefficients to thereby 

20 produce a synthesized wideband signal, and an oversampler responsive to 
said synthesized wideband signal for producing an over-sampled signal 
version of the synthesized wideband signal; 

the improvement comprising a high-frequency content recovering device as 
recited in claim 1 for recovering a high frequency content of said wideband 
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signal and for injecting said high frequency content in said over-sampled 
signal version to produce the full-spectrum synthesized wideband signal. 

17. A decoder for producing a synthesized wideband signal as defined in 
claim 16, wherein said noise generator comprises a random noise generator 

5 for producing a white noise sequence whereby said spectral shaping unit 
produces a spectrally-shaped white noise sequence. 

18. A decoder for producing a synthesized wideband signal as defined in 
claim 17, wherein said spectral shaping unit further comprises: 

10 a) a gain adjustment module, responsive to said white noise 

sequence and a first subset of said shaping parameters, for producing a 
scaled white noise sequence; 

b) a spectral shaper for filtering said scaled white noise sequence 
in relation to a second subset of said shaping parameters comprising 

1 5 bandwidth expanded synthesis filter coefficients to produce a filtered scaled 
white noise sequence characterized by a frequency bandwidth generally 
higher than a frequency bandwidth of said over-sampled synthesized signal 
version; and 

c) a band-pass filter responsive to comprising the above 
20 described decoder said filtered scaled white noise sequence for producing 

a band-pass filtered scaled white noise sequence to be subsequently 
injected in said over-sampled synthesized signal version as said spectrally- 
shaped white noise sequence. 
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1 9. A decoder for producing a synthesized wideband signal as defined in 

claim 18, further comprising: 

a) a voicing factor generator responsive to said adaptive and 
innovative codevectors for calculating a voicing factor for forwarding to said 

5 gain adjustment module; 

b) an energy computing module responsive to said excitation 
signal for calculating an excitation energy for forwarding to said gain 
adjustment module; and 

c) a spectral tilt calculator responsive to said synthesized signal 
10 for calculating a tilt scaling factor for forwarding to said gain adjustment 

module; 

wherein said first subset of said shaping parameters comprises said voicing 
factor, said energy scaling factor, and said tilt scaling factor, and wherein 
said second subset of said shaping parameters includes linear prediction 
1 5 coefficients. 

20. A decoder for producing a synthesized wideband signal as defined in 
claim 19, wherein said voicing factor generator comprises a means for 
calculating said voicing factor r v using the relation: 

20 

r v = (E v - £ c ) / (E v + E c ) 

where E v is the energy of a gain-scaled version of the pitch codevector and 
E c is the energy of a gain-scaled version of the innovative codevector. 
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21. A decoder for producing a synthesized wideband signal as defined in 
claim 19, wherein said gain adjusting unit comprises a means for calculating 
an energy scaling factor using the relation: 



where vf is said white noise sequence and u' is an enhanced excitation 
signal derived from said excitation signal. 

22. A decoder for producing a synthesized wideband signal as defined in 
claim 19, wherein said spectral tilt calculator comprises a means for 
calculating said tilt scaling factor g t using the relation: 



Energy scaling factor = 




n=0 AM. 



g t =1-tilt 



bounded by 0.2 <; g, <;1.0 



where 



12 s h (") s h ( n_1 > 



, conditioned by tilt > 0 and tilt k 



tilt = 
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23. A decoder for producing a synthesized wideband signal as defined in 
claim 19, wherein said spectral tilt calculator comprising a means for 
calculating said tilt scaling factor g t using the relation: 

g r 10 bounded by 0.2 s g,*1.0 

where 



A/-1 

tut = — , conditioned by tilt * 0 and tilt z 

£ V (») 

n=0 



24. A decoder for producing a synthesized wideband signal as defined in 
claim 18, wherein said band-pass filter comprises a frequency bandwidth 
located between 5.6 kHz and 7.2 kHz. 

25. A cellular communication system for servicing a large geographical area 
divided into a plurality of cells, comprising: 

a) mobile transmitter/receiver units; 

b) cellular base stations respectively situated in said cells; 
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c) a control terminal for controlling communication between the 
cellular base stations; 

d) a bidirectional wireless communication sub-system between 
each mobile unit situated in one cell and the cellular base station of said one 
cell, said bidirectional wireless communication sub-system comprising, in 
both the mobile unit and the cellular base station: 

i) a transmitter including an encoder for encoding a 
wideband signal and a transmission circuit for transmitting the 
encoded wideband signal; and 

ii) a receiver including a receiving circuit for receiving a 
transmitted encoded wideband signal and a decoder as recited in 
claim 7 for decoding the received encoded wideband signal. 

26. A cellular communication system as defined in claim 25, wherein said 
noise generator comprises a random noise generator for producing a white 

15 noise sequence whereby said spectral shaping unit produces a spectrally- 
shaped white noise sequence. 

27. A cellular communication system as defined in claim 26, wherein said 
spectral shaping unit further comprises: 

20 a) a gain adjustment module, responsive to said white noise 

sequence and a first subset of said shaping parameters, for producing a 
scaled white noise sequence; 

b) a spectral shaper for filtering said scaled white noise sequence in 
relation to a second subset of said shaping parameters comprising 



5 



10 
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bandwidth expanded synthesis filter coefficients to produce a filtered scaled 
white noise sequence characterized by a frequency bandwidth generally 
higher than a frequency bandwidth of said over-sampled synthesized signal 
version; and 

c) a band-pass filter responsive to said filtered scaled white noise 
5 sequence for producing a band-pass filtered scaled white noise sequence 
to be subsequently injected in said over-sampled synthesized signal version 
as said spectrally-shaped white noise sequence. 

28. A cellular communication system as defined in claim 27, further 
1 0 comprising: 

a) a voicing factor generator responsive to said adaptive and 
innovative codevectors for calculating a voicing factor for forwarding to said 
gain adjustment module; 

b) an energy computing module responsive to said excitation signal 
15 for calculating an excitation energy for forwarding to said gain adjustment 

module; and 

c) a spectral tilt calculator responsive to said synthesized signal for 
calculating a tilt scaling factor for forwarding to said gain adjustment module; 
wherein said first subset of said shaping parameters comprises said voicing 

20 factor, said energy scaling factor, and said tilt scaling factor, and wherein 
said second subset of said shaping parameters includes linear prediction 
coefficients. 
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29. A cellular communication system as defined in claim 28, wherein said 
voicing factor generator comprises a means for calculating said voicing 
factor r v using the relation: 

r v = (E v - E c ) I (E v + E c ) 

5 

where E v is the energy of a gain-scaled version of the pitch codevector and 
E c is the energy of a gain-scaled version of the innovative codevector. 

30. A cellular communication system as defined in claim 28, wherein said 
10 gain adjusting unit comprises a means for calculating an energy scaling 

factor using the relation: 



A/-1 

, n=0 AM. 

n=0 

where w 1 is said white noise sequence and u' is an enhanced excitation 
signal derived from said excitation signal. 

20 

31. A cellular communication system as defined in claim 28, wherein said 
spectral tilt calculator comprises a means for calculating said tilt scaling 
factor g, using the relation: 

g, = 1 - tilt bounded by 0.2 s g, * 1 .0 



15 Energy scaling factor = 

N 
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where 



A/-1 



V s (n) s„ (n-1) 
///f = j7=i , conditioned by tilt > 0 and tilt ;> 

E *„ 2 (n) 



32. A cellular communication system as defined in claim 28, wherein said 
spectral tilt calculator comprising a means for calculating said tilt scaling 
factor g t using the relation: 

10 

g ' _1 u bounded by 0.2 <; g, *1 .0 

where 

N-1 

, conditioned by fr/f * 0 and tilt z r v 

W-1 

E V (n) 



15 

f//f = 



20 33. A cellular communication system as defined in claim 27, wherein said 
band-pass filter comprises a frequency bandwidth located between 5.6 kHz 
and 7.2 kHz. 



34. A cellular mobile transmitter/receiver unit comprising: 
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a) a transmitter including an encoder for encoding a wideband 
signal and a transmission circuit for transmitting the encoded wideband 
signal; and 

b) a receiver including a receiving circuit for receiving a 
transmitted encoded wideband signal and a decoder as recited in claim 7 for 

5 decoding the received encoded wideband signal. 

35. A cellular mobile transmitter/receiver unit as defined in claim 34, wherein 
said noise generator comprises a random noise generator for producing a 
white noise sequence whereby said spectral shaping unit produces a 

1 0 spectrally-shaped white noise sequence. 

36. A cellular mobile transmitter/receiver unit as defined in claim 35, wherein 
said spectral shaping unit further comprises: 

a) a gain adjustment module, responsive to said white noise 
15 sequence and a first subset of said shaping parameters, for producing a 

scaled white noise sequence; 

b) a spectral shaper for filtering said scaled white noise sequence 
in relation to a second subset of said shaping parameters comprising 
bandwidth expanded synthesis filter coefficients to produce a filtered scaled 

20 white noise sequence characterized by a frequency bandwidth generally 
higher than a frequency bandwidth of said over-sampled synthesized signal 
version; and 

c) a band-pass filter responsive to said filtered scaled white noise 
sequence for producing a band-pass filtered scaled white noise sequence 
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to be subsequently injected in said over-sampled synthesized signal version 
as said spectrally-shaped white noise sequence. 

37. A cellular mobile transmitter/receiver unit as defined in claim 36, further 
comprising: 

5 a) a voicing factor generator responsive to said adaptive and 

innovative codevectors for calculating a voicing factor for forwarding to said 
gain adjustment module; 

b) an energy computing module responsive to said excitation 
signal for calculating an excitation energy for forwarding to said gain 

1 0 adjustment module; and 

c) a spectral tilt calculator responsive to said synthesized signal 
for calculating a tilt scaling factor for forwarding to said gain adjustment 
module; 

wherein said first subset of said shaping parameters comprises said voicing 
15 factor, said energy scaling factor, and said tilt scaling factor, and wherein 
said second subset of said shaping parameters includes linear prediction 
coefficients. 

38. A cellular mobile transmitter/receiver unit as defined in claim 37, wherein 
20 said voicing factor generator comprises a means for calculating said voicing 

factor r v using the relation: 



r v = (E v - E c ) I (E v + E c ) 
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where E v is the energy of a gain-scaled version of the pitch codevector and 
E c is the energy of a gain-scaled version of the innovative codevector. 

39. A cellular mobile transmitter/receiver unit as defined in claim 37, wherein 
said gain adjusting unit comprises a means for calculating an energy scaling 
factor using the relation: 



where w 9 is said white noise sequence and u' is an enhanced excitation 
signal derived from said excitation signal. 

40. A cellular mobile transmitter/receiver unit as defined in claim 37, wherein 
said spectral tilt calculator comprises a means for calculating said tilt scaling 
factor g t using the relation: 



£"' 2 (") 



Energy scaling factor 




n=0,...,A/-1. 



g,= 1-ff/f 



bounded by 0.2 <; g, *1.0 



where 



£s„ (n) s h 



, conditioned by tilt k 0 and tilt * 



tilt = 
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41 . A cellular mobile transmitter/receiver unit as defined in claim 37, wherein 
said spectral tilt calculator comprising a means for calculating said tilt scaling 
factor g t using the relation: 

n -in 0 - 6 "" 

5 »r lu bounded by 0.2 s g, s1.0 

where 



10 



W-1 



tut = , conditioned by tilt s 0 and tilt ^ r. 

E ^ 2 (n) 

15 



42. A cellular mobile transmitter/receiver unit as defined in claim 36, wherein 
said band-pass filter comprises a frequency bandwidth located between 5.6 
kHz and 7.2 kHz. 

43. A cellular network element comprising: 

a) a transmitter including an encoder for encoding a wideband 
signal and a transmission circuit for transmitting the encoded wideband 
signal; and 
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b) a receiver including a receiving circuit for receiving a 
transmitted encoded wideband signal and a decoder as recited in claim 7 for 
decoding the received encoded wideband signal. 

44. A cellular network element as defined in claim 43, wherein said noise 
generator comprises a random noise generator for producing a white noise 
sequence whereby said spectral shaping unit produces a spectrally-shaped 
white noise sequence. 

45. A cellular network element as defined in claim 44, wherein said spectral 
shaping unit further comprises: 

a) a gain adjustment module, responsive to said white noise 
sequence and a first subset of said shaping parameters, for producing a 
scaled white noise sequence; 

b) a spectral shaper for filtering said scaled white noise sequence 
in relation to a second subset of said shaping parameters comprising 
bandwidth expanded synthesis filter coefficients to produce a filtered scaled 
white noise sequence characterized by a frequency bandwidth generally 
higher than a frequency bandwidth of said over-sampled synthesized signal 
version; and 

c) a band-pass filter responsive to said filtered scaled white noise 
sequence for producing a band-pass filtered scaled white noise sequence 
to be subsequently injected in said over-sampled synthesized signal version 
as said spectrally-shaped white noise sequence. 
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46. A cellular network element as defined in claim 45, further comprising: 

a) a voicing factor generator responsive to said adaptive and 
innovative codevectors for calculating a voicing factor for forwarding to said 
gain adjustment module; 

b) an energy computing module responsive to said excitation 
5 signal for calculating an excitation energy for forwarding to said gain 

adjustment module; and 

c) a spectra! tilt calculator responsive to said synthesized signal 
a,. for calculating a tilt scaling factor for forwarding to said gain adjustment 

module; 

1 0 wherein said first subset of said shaping parameters comprises said voicing 
factor, said energy scaling factor, and said tilt scaling factor, and wherein 
said second subset of said shaping parameters includes linear prediction 
coefficients. 

15 47. A cellular network element as defined in claim 46, wherein said voicing 
factor generator comprises a means for calculating said voicing factor r v 
using the relation: 

r v = (E v - E c ) I (E„ + E c ) 

20 

where E v is the energy of a gain-scaled version of the pitch codevector and 
E c is the energy of a gain-scaled version of the innovative codevector. 
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48. A cellular network element as defined in claim 46, wherein said gain 
adjusting unit comprises a means for calculating an energy scaling factor 
using the relation: 



5 

Energy scaling factor = 

where is said white noise sequence and u' is an enhanced excitation 
1 0 signal derived from said excitation signal. 

49. A cellular network element as defined in claim 46, wherein said spectral 
tilt calculator comprises a means for calculating said tilt scaling factor g t 
using the relation: 
15 g t = 1 - tilt bounded by 0.2 <; g t ^1.0 

where 



A/-1 

E 

ff=0 

E 

n=0 



E"' 2 C) 



E w 2 (") 



n=0 AM. 



20 



W-1 

tilt = — 



W-1 

E V (n) 

n=0 



, conditioned by f/7f ;> 0 and f//f ;> 
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50. A cellular network element as defined in claim 46, wherein said spectral 
tilt calculator comprising a means for calculating said tilt scaling factor ft 
using the relation: 

5 9 t ^o bounded by 0.2 s ft* 1.0 

where 

1 0 tut = Jul , conditioned by tilt s> 0 and tilt > 

/l=0 

51. A cellular network element as defined in claim 45, wherein said band- 
1 5 pass filter comprises a frequency bandwidth located between 5.6 kHz and 

7.2 kHz. 

52. In a cellular communication system for servicing a large geographical 
area divided into a plurality of cells, comprising: mobile transmitter/receiver 

20 units; cellular base stations, respectively situated in said cells; and control 
terminal for controlling communication between the cellular base stations: 
a bidirectional wireless communication sub-system between each 
mobile unit situated in one cell and the cellular base station of said one cell, 
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said bidirectional wireless communication sub-system comprising, in both the 
mobile unit and the cellular base station: 

a) a transmitter including an encoder for encoding a wideband 
signal and a transmission circuit for transmitting the encoded wideband 
signal; and 

b) a receiver including a receiving circuit for receiving a 
transmitted encoded wideband signal and a decoder as recited in claim 7 for 
decoding the received encoded wideband signal. 

53. A bidirectional wireless communication sub-system as defined in claim 

52, wherein said noise generator comprises a random noise generator for 
producing a white noise sequence whereby said spectral shaping unit 
produces a spectrally-shaped white noise sequence. 

54. A bidirectional wireless communication sub-system as defined in claim 

53, wherein said spectral shaping unit further comprises: 

a) a gain adjustment module, responsive to said white noise 
sequence and a first subset of said shaping parameters, for producing a 
scaled white noise sequence; 

b) a spectral shaper for filtering said scaled white noise sequence 
in relation to a second subset of said shaping parameters comprising 
bandwidth expanded synthesis filter coefficients to produce a filtered scaled 
white noise sequence characterized by a frequency bandwidth generally 
higher than a frequency bandwidth of said over-sampled synthesized signal 
version; and 
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c) a band-pass filter responsive to said filtered scaled white noise 
sequence for producing a band-pass filtered scaled white noise sequence 
to be subsequently injected in said over-sampled synthesized signal version 
as said spectrally-shaped white noise sequence. 

5 55. A bidirectional wireless communication sub-system as defined in claim 

54, further comprising: 

a) a voicing factor generator responsive to said adaptive and 
innovative codevectors for calculating a voicing factor for forwarding to said 
gain adjustment module; 
10 b) an energy computing module responsive to said excitation 

signal for calculating an excitation energy for forwarding to said gain 
adjustment module; and 

c) a spectral tilt calculator responsive to said synthesized signal 
for calculating a tilt scaling factor for forwarding to said gain adjustment 
15 module; 

wherein said first subset of said shaping parameters comprises said voicing 
factor, said energy scaling factor, and said tilt scaling factor, and wherein 
said second subset of said shaping parameters includes linear prediction 
coefficients. 

20 

56. A bidirectional wireless communication sub-system as defined in claim 

55, wherein said voicing factor generator comprises a means for calculating 
said voicing factor r v using the relation: 

r v = (E v - E c ) I (E v + E c ) 
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where E v is the energy of a gain-scaled version of the pitch codevector and 
E c is the energy of a gain-scaled version of the innovative codevector. 

57. A bidirectional wireless communication sub-system as defined in claim 
55, wherein said gain adjusting unit comprises a means for calculating an 
energy scaling factor using the relation: 



where W is said white noise sequence and u' is an enhanced excitation 
signal derived from said excitation signal. 

58. A bidirectional wireless communication sub-system as defined in claim 
55, wherein said spectral tilt calculator comprises a means for calculating 
said tilt scaling factor g t using the relation: 



5>' 2 (") 



Energy scaling factor = 




n=0 AM. 



E «"' 2 <") 



g, = 1 - tilt 



bounded by 0.2 sg,i1.0 



where 



tilt = 



, conditioned by tilt ^ 0 and tilt z r^ 
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59. A bidirectional wireless communication sub-system as defined in claim 
55, wherein said spectral tilt calculator comprising a means for calculating 
said tilt scaling factor g t using the relation: 

5 

g '~ 1u bounded by 0.2 <; g t *1.0 



where 



10 J2s h (n) s h (n-1) 



tut = — . conditioned by tilt :> 0 and tilt * 

E *„ 2 (n) 



15 



60. A bidirectional wireless communication sub-system as defined in claim 
54, wherein said band-pass filter comprises a frequency bandwidth located 
between 5.6 kHz and 7.2 kHz. 
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